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Abstract 

Wc give the first constant-factor approximation aigoritlini for Sparsest-Cut witli general de- 
mands in bounded treewidtli graphs. In contrast to previous algorithms, which rely on the 
flow-cut gap and/or metric embeddings, our approach exploits the Sherali-Adams hierarchy of 
linear programming relaxations. 



1 Introduction 



The Sparsest-Cut problem is one of the most famous graph optimization problems. The problem 
has been studied extensively due to the central role it plays in several respects. First, it represents 
a basic graph partitioning task that arises in several contexts, such as divide-and-conquer graph al- 
gorithms (see e.g. |LR99, 3hm97] and [VazOl, Chapter 21]). Second, it is intimately related to other 
graph parameters, such as flows, edge-expansion, conductance, spectral gap and bisection-width. 
Third, there are several deep technical links between Sparsest-Cut and two seemingly unrelated 
concepts, the Unique Games Conjecture and Metric Embeddings. 

Given that Sparsest-Cut is known to be NP-hard |MS86], the problem has been studied exten- 
sively from the perspective of polynomial-time approximation algorithms. Despite significant efforts 
and progress in the last two decades, we are still quite far from determining the approximability of 
Sparsest-Cut. This is true not only for general graphs, but also for several important graph families, 
such as planar graphs or bounded treewidth graphs. The latter family is the focus of this paper; 
we shall return to it after setting up some notation and defining the problem formally. 



Problem definition. For a graph G = (V, E) we let n = For S CV^ the cutset (5, S) CV ^ 
V is the set of unordered pairs with exactly one endpoint in 5, i.e. {{u, v} xV : u G S,v ^ S}. 
In the Sparsest-Cut problem (with general demands), the input is a graph G = {V,E) with edge 
capacities cap : E — M>o and a set of demand pairs, D = ({si,ti}, . . . ,{sfc,ifc}) with a demand 
function dem : D — ^ M>o- The goal is to find S C V {a cut of G) that minimizes the ratio 

E(«,^)e(5,5)nDdem(n,7;)" 

The demand function dem is often set to dem(s,t) = 1 for all {s,t) € D. The special case where, 
in addition to this, the demand set D includes all vertex pairs is referred to as uniform demands. 
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Treewidth. Let G = (V, E) be a graph. A tree decomposition oi G = (V, E) is a pair [B, T) 
where B = {Si, ... , -Bm} is a family of subsets Bi V called bags, and T is a tree whose nodes 
are the bags Bi, satisfying the following properties: (i) V = {J^Bi; (ii) For every edge {u,v) G E, 
there is a bag Bj that contains both u, v; and (iii) For each v £ V, all the bags Bi containing v 
form a connected subtree of T. The width of the tree decomposition is maxj \ Bi \ — 1. The treewidth 
of G, denoted tw(G), is the smallest width among all tree decompositions of G. The pathwidth 
of G is defined similarly, except that T is restricted to be a path; thus, it is at least tw(G). It is 
straightforward to see that every graph G excludes as a minor the complete graph on tw(G) + 2 
vertices. Thus, the family of graphs of tree width r contains the family of graphs with pathwidth 
r, and is contained in the family of graphs excluding Kr+2 as a minor (here Kr-\-2 refers to the 
complete graph on r + 2 vertices). 



1.1 Results 



We present the first algorithm for general demand Sparsest-Cut that achieves a constant factor 
approximation for graphs of bounded treewidth r (the restriction is only on the structure of the 
graph, not the demands). Such an algorithm is conjectured to exist by IGNRSO^ ] (they actually 



make a stronger conjecture, see Section 1.3 for details). However, previously such an algorithm was 
not known even for r = 3, although several algorithms are known for r = 2 [ GNRS04 , CJLV08| , 
CSWlOfl and for bounded-pathwidth graphs | LS09| (which is a subfamily of bounded-treewidth 
graphs) . 

Theorem 1.1. There is an algorithm for Sparsest-Cut (general demands) on graphs of treewidth r, 
that runs in time {T'n)'~'^^^ and achieves approximation factor G = G{r) (independently of n, the 
size of the graph). 

Table ^ lists the best approximation algorithms known for various special cases of Sparsest-Cut. 
We remark that the problem (with general demands) is NP-hard even for pathwidth 2; we include 
a proof of this fact in Appendix |A| for the sake of completeness, as it is unclear whether this has 
appeared previously in the literature. 



Demands 



Graphs 



Approximation Based on 



Reference 



general 



arbitrary 
treewidth 2 
fixed outerplanarity 
excluding W4-minor 
fixed pathwidth 
fixed treewidth 



0[^\og\D\) 
2 

0(1) 
0(1) 
0(1) 
0(1) 



SDP 

LP (flow) 

LP (integer flow) 

LP (flow) 

LP (flow) 

LP (lifted) 



IGNRS04, 


CJLV08 


ICGN+06 


cswici 



JLV05 



S09 | 



This work 



uniform 



arbitrary 0{y/\ogn) 
excluding fixed- minor 0(1) 
fixed treewidth 0(1) 
fixed treewidth 1 



ARV09I1 



KPR93I , proq] 



SDP 

LP (flow) 
LP (flow) [|Rab03| , |CKS09 | 

dynamic programming 



Table 1: Approximation algorithms for Sparsest-Cut. 
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Techniques. Similarly to almost all previous work, om' algorithm is based on rounding a linear 
programming (LP) relaxation of the problem. A unique feature of our algorithm is that it employs 
an LP relaxation derived from the hierarchy of (increasingly stronger) LPs, designed by Sherali and 
Adams [ 3A9CI| ]. Specifically, we use level r + 0(1) of this hierarchy. In contrast, all prior work on 



Sparsest-Cut uses either the standard LP (that arises as the dual of the concurrent-flow problem, see 
e.g. | LR99| ), or its straightforward strengthening to a semidefinite program (SDP). Consequently, 



the entire setup changes significantly (e.g. the known connections to embeddings and flow, see 
Section |1.2| ), and we face the distinctive challenges of exploiting the complex structure of these 
relaxations (see Section |L^ ). 

While bounding the integrality gap of the standard LP (the flow-cut gap) for various graph fam- 
ilies remains an important open problem with implications in metric embeddings (see Section |l.2D , 
our focus is on directly approximating Sparsest-Cut. Accordingly, our LP is larger and (possibly 
much) stronger than the standard flow LP, and hence our rounding does not imply a bound on the 
flow-cut gap (akin to rounding of the SDP relaxation in |ARV09| , |CGRO^ , |ALN08(| ). 



Finally, note that the running time stated in Theorem LI is much better than the n^^"^^ running 
time typically needed to solve the r + 0(1) level of Sherali- Adams (or any other hierarchy). The 
reason is that only 0{3^'n\D\) of the Sherali-Adams variables and constraints are really needed for 



our analysis to go through (see Remark 3^), thus greatly improving the time needed to solve the 
LP. As the rounding algorithm we use is a simple variant of the standard method of randomized 
rounding for LP's (adapted for Sherali-Adams relaxations on bounded-treewidth graphs), the entire 
algorithm is both efficient and easily implementable. 

1.2 The GNRS excluded-minor conjecture 



Gupta, Newman, Rabinovich and Sinclair (GNRS) conjectured in [ |GNRS04 ] that metrics supported 



on graphs excluding a fixed minor embed into £i with distortion 0(1) (i.e. independent of the graph 
size). By the results of |LLR95, AR95, GNRSO^ , this conjecture is equivalent to saying that in all 



such graphs (regardless of the capacities and demands), the ratio between the sparsest-cut and the 
concurrent-flow, called the flow-cut gap, is bounded by 0(1). Since the concurrent-flow problem is 
polynomial-time solvable (e.g. by linear programming), the conjecture would immediately imply 
that Sparsest-Cut admits 0(1) approximation (in polynomial-time) on these graphs. 

Despite extensive research, the GNRS conjecture is still open, even in the special cases of planar 
graphs and of graphs of treewidth 3. The list of special cases that have been resolved includes graphs 
of treewidth 2, 0(l)-outerplanar graphs, graphs excluding a 4-wheel minor, and bounded-pathwidth 
graphs; see Table ||, where the flow LP is mentioned. 

Our approximation algorithm may be interpreted as evidence supporting the GNRS conjecture 
(for graphs of bounded treewidth), since by the foregoing discussion, the conjecture being true would 
imply the existence of such approximation algorithms, and moreover that our LP's integrality gap 
is bounded. In fact, one consequence of our algorithm and its analysis can be directly phrased in 
the language of metric embeddings: 

Corollary 1.2. For every r there is some constant C = C{r) such that every shortest-path metric 
on a graph of treewidth < r, for which every set of size r-\-'i is isometrically embeddable into Li in a 
locally consistent way (i.e. the embeddings of two such sets, when viewed as probability distributions 
over cuts, are consistent on the intersection of the sets), can be embedded into Li with distortion 
at most C. 
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If, on the other hand, the GNRS conjecture is false, then our algorithm (and its stronger 
LP) gives a substantial improvement over techniques using the flow LP, and may have surprising 



implications for the Sherali-Adams hierarchy (see Section 1.3). Either way, our result opens up 



several interesting questions, which we discuss in Section 1.4 



1.3 Related work 

Relaxation hierarchies and approximation algorithms. A research plan that has attracted 
a lot of attention in recent years is the use of lift-and-project methods to design improved approxima- 
tion algorithms for NP-hard optimization problems. These methods, such as Sherali-Adams | pA90 |, 
Lovasz-Schrijver p^S91| , and Lasserre [|Las02|] (see [ Lau03 | for a comparison), systematically gener- 
ate, for a given {0, 1} program (which can capture many combinatorial optimization problems, e.g. 
Vertex-Cover), a sequence (aka hierarchy) of increasingly stronger relaxations. The first relaxation 
in this sequence is often a commonly-used LP relaxation for that combinatorial problem. After n 
steps (which are often called rounds or levels), the sequence converges to the convex hull of the in- 
tegral solutions, and the A:-th relaxation in the sequence is a convex program (LP or SDP) that can 
be solved in time rP^^\ Therefore, the first few, say 0(1), relaxations in the sequence offer a great 
promise to approximation algorithms — they could be much stronger than the commonly-used LP 
relaxation, yet are polynomial-time computable. This is particularly promising for problems for 
which there is a gap between known approximations and proven hardness of approximation (or 
when the hardness relies on weaker assumptions than P ^ NP). 



Unfortunately, starting with the work of Arora, Bollobas, Lovasz, and Tourlakis [ ABLTOE ] on 
Vertex-Cover, there has been a long line of work showing that for various problems, even after a large 
(super-constant) number of rounds, various hierarchies do not yield smaller integrality gaps than 
a basic LP/SDP relaxation (see, e.g. [ pTTOT] , pMPT07| , |Sch08| , fl\II09| , pMM09|] ). In particular, 
Raghavendra and Steurer |RSOE ] have recently shown that a superconstant number of rounds of 
certain SDP hierarchies does not improve the integrality gap for any constraint satisfaction problem 
(MAX-CSP). 

In contrast, only few of the known results are positive, i.e. show that certain hierarchies 
give a sequence of improvements in the integrality gap in their first 0(1) levels — this has 
been shown for Vertex-Cover in planar graphs [ MM09 |, Max-Cut in dense graphs |dlVK07 |, Knap- 
sack |KMN09 , BieOSU , and Maximum Matching | MS09| |. There are even fewer results where the 
improved approximation is the state-of-the-art for the respective problem — such results include 
recent work on Chromatic Number [Phl07| ], Hypergraph Independent Set [ICSOSf , and MaxMin Allo- 
[BCGO|. 



cation 



In the context of bounded-treewidth graphs, a bounded number of rounds in the Sherali-Adams 
hierarchy is known to be tight (i.e. give exact solutions) for many problems that are tractable 
on this graph family, such as CSPs [WJ04]. This is only partially true for Sparsest-Cut — due to 
the exact same reason, we easily find in the graph a cut whose edge capacity exactly matches the 
corresponding expression in the LP. However, the demands are arbitrary (and in particular do not 
have a bounded-treewidth structure), and analyzing them requires considerably more work. 



Hardness and integrality gaps for sparsest-cut. As mentioned earlier, Sparsest-Cut is known 
to be NP-hard [IVIS86|, and we further show in Appendix |A| that it is even NP-hard on graphs of 
pathwidth 2. Two results [ KV05 , independently proved that under Khot's unique games 
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conjecture Kho02| ], the Sparsest-Cut problem is NP-hard to approximate within any constant factor. 
However, the graphs produced by the reductions in these two results have large treewidth. 



The standard flow LP relaxation for Sparsest-Cut was shown in [LR9£] to have integrality gap 
il(logn) in expander graphs, even for uniform demands. Its standard strengthening to an SDP 
relaxation (the SDP used by the known approximation algorithms of [ ARV09 , ALN08|| ) was shown 
in [ KV05| , |KR09| , DKSV06| to have integrality gap O(loglogn), even for uniform demands. For the 
case of general demands, a stronger bound (logn)^^^) was recently shown in [ CKN09 |. Some of 



these results were extended in |CMM09" , RS09| to certain hierarchies and a nontrivial number of 
rounds, even for uniform demands. Again, the graphs used in these results have large treewidth. 

Integrality gaps for graphs of treewidth r (or excluding a fixed minor of size r) follow from 
the above in the obvious way of replacing n with r (or so), for instance, the standard flow LP has 
integrality gap r2(logr). However, no stronger gaps are known for these families; in particular, it is 
possible that the integrality gap approaches 1 with sufficiently many rounds (depending on r, but 
not on n). 



1.4 Discussion and further questions 

We show that for the Sparsest-Cut problem, the Sherali-Adams (SA) LP hierarchy can yield algo- 
rithms with better approximation ratio than previously known. Moreover, our analysis exhibits a 
strong (but rather involved) connection between the input graph's treewidth and the SA hierarchy 
level. Several interesting questions arise immediately: 

1. Can this approach be generalized to excluded-minor graphs? 

2. Can the approximation factor be improved to an absolute constant (independent of the 
treewidth)? 

A particularly intriguing and more fundamental question is whether this hierarchy (or a related 
one, or for a different input family) is strictly stronger than the standard LP (or SDP) relaxation. 
One possibility is that our relaxation can actually yield an absolute constant factor approximation 
(as in Question 2). Such an approximation factor is shown in |CMM09 | to require at least O(logr) 
rounds of Sherali-Adams, and we would conclude that hierarchies yield strict improvement — higher 
(yet constant) levels of the Sherali-Adams hierarchy do give improved approximation factors, for an 
increasing sequence of graph families. We note, however, that this would require a different rounding 
algorithm (see Remark 12 and Section]^. Another possibility is that the GNRS conjecture does 
not hold even for bounded treewidth graphs, in which case the integrality gap of the standard LP 
exhibits a dependence on n, while, as we prove here, the stronger LP does not. 



2 Technical Overview 

Relaxations arising from the Sherali-Adams (SA) hierarchy, and lift-and-project techniques in gen- 
eral, are known to give LP (or SDP) solutions which satisfy the following property: for every 
subset of variables of bounded size (bounded by the level in the hierarchy used), the LP/SDP 
solution restricted to these variables is a convex combination of valid {0, 1} assignments. Such a 
convex combination can naturally be viewed as a distribution on local assignments. In our case, 
for example, in an induced subgraph on r -|- 1 vertices S, an (r -|- l)-level relaxation gives a local 
distribution on assignments / : 5 — )■ {0, 1} such that for every edge within S, the probability 
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that f{i) ^ f{j) is exactly the contribution of edge to the objective function (which we also 
call the LP-distance of this pair). Our algorithm makes explicit use of this property, which is very 
useful for treewidth r graphs. 

Given an (r + 3)-level Sherali-Adams relaxation, for every demand pair there is some distri- 
bution which (within every bag) matches the local distributions suggested by the LP, and also 
cuts/separates this demand pair (i.e. assigns different values to its endpoints) with the correct 
probability (the LP distance). Unfortunately, there might not be any single distribution which is 
consistent with all demand pairs, so instead our algorithm assigns {0, 1} values at random to the 
vertices of the graph G in a stochastic process which matches the local distributions suggested by 
the LP solution (per bag), but is oblivious to the structure of the demands D. 



Intuition. To achieve a good approximation ratio, it suffices to ensure that every demand pair 
is cut with probability not much smaller than the its LP distance. To achieve this, the algorithm 
fixes an arbitrary bag as the root, and traverses the tree decomposition one bag at a time, from 
the root towards the leaves, and samples the assignment to currently unassigned vertices in the 
current bag. This assignment is sampled in a way that ignores all previous assignments to vertices 
outside the current bag, but achieves the correct distribution on assignments to the current bag. 
Essentially, the algorithm finds locally correct distributions while maximizing the entropy of the 
overall distribution. Intuitively, this should only "distort" the distribution suggested by the LP (for 
a given demand pair) only by introducing noise, which (if the noise is truly unstructured) mixes the 
correct global distribution with a completely random one in which every two vertices are separated 
with probability |. In this case, the probability of separating any demand pair would decrease by 
at most a factor 2. Unfortunately, we are not able to translate this intuition into a formal proof 
(and on some level, it is not accurate ~ see Remark O). Thus we are forced to adopt a different 
strategy in analyzing the performance of the rounding algorithm. Let us see one illustrative special 
case. 



Example: Simple Paths. Consider, for concreteness, the case of a single simple path vi,V2, ■ ■ ■ ,v. 
For every edge in the path {vi-i,Vi), the LP suggests cutting it (assigning different values) with 
some probability pi. Our algorithm will perform the following Markov process: pick some assign- 
ment f{vi) G {0, 1} at random according to the LP, and then, at step i (for i = 2, . . . ,n) look only 
at the assignment f{vi-i) and let f{vi) = 1 — f{vi-i) with probability pi, and f{vi) = f{vi-i) oth- 
erwise. Each edge has now been cut with exactly the probability corresponding to its LP distance. 
However, for (ui, u„), which could be a demand pair, the LP distance between them might be much 
greater than the probability qn = Pr[/(?;i) ^ /(t'n)]- Let us see that the LP distance can only be 
a constant factor more. 

First, if the above probability satisfies \, then clearly we are done, as all LP distances will 
be at most 1. Thus we may assume that qn ^ \- Let us examine what happens at a single step. 
Suppose the algorithm has separated vi from Vi-i with some probability qi-i < ^ (assuming that 
all 9i < 5 is a somewhat stronger assumption than qn ^ ^, but a more careful analysis shows it 
is also valid). After the current step (flipping sides with probability pi), the probability that Vi is 
separated from vi is exactly (1 — qi-i)pi + 9i-i(l — Pi)- This is an increase over the previous value 
of at least 

[(1 - qi-i)pi + qi-i{l - Pi)] - qi-i = (1 - 2qi^i)pi > pi/3. 
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However, the LP distance from vi can increase by at most pi (by triangle inequality). Thus, we 
can show inductively that we never lose more than a factor 3. 

In general, our analysis will consider paths of bags of size r + 1. Even though we can still 
express the distribution on assignments chosen by the rounding algorithm as a Markov process 
(where the possible states at every step will be assignments to some set of at most r vertices), it 
will be less straightforward to relate the LP values to this process. It turns out that we can get 
a handle on the LP distances by modeling the Markov process as a layered digraph H with edges 
capacities representing the transitions (this is only in the analysis, or in the derandomization of our 
algorithm). In this case the LP distance we wish to bound becomes the value of a certain (s, t)-flow 
in H. We then bound the flow-value from above by finding a small cut in H. Constructing and 
bounding the capacity of such a cut in H constitutes the technical core of this work. 

3 The Algorithm 

3.1 An LP relaxation using the Sherali Adams hierarchy 

Let us start with an informal overview of the Sherali-Adams (SA) hierarchy. In an LP relaxation 
for a 0-1 program, the linear variables {yi | i G [n]} represent linear relaxations of integer variables 
Xi G {0, 1}. We can extend such a relaxation to include variables {y/} for larger subsets / C [n] 
(usually, up to some bounded cardinality) . These should be interpreted as representing the products 
f^jgjXi in the intended (integer) solution. Now, for any pair of sets I, J C [n], we will denote by 
yj^j the linear relaxation for the polynomial Hie/^-'- ~ •^i)Yljej ^j- These can be derived from the 
variables yj by the inclusion-exclusion principle. That is, we define 

yi,J = E/'c/(-l)'^''y/'uJ- 

The constraints defined by the polytope SAt(n), that is, level t of the Sherali-Adams hierarchy 
starting from the trivial n-dimensional LP, are simply the inclusion-exclusion constraints: 

V/, J C [n] s.t. \lLlJ\<t : yj^j > (1) 

For every solution other than the trivial (all-zero) solution, we can define a normalized solution 
{yj} as follows: 

yi = yi/y0, 

and the normalized derived variables y/^j can be similarly defined. 

As is well-known, in a non-trivial level t Sherali-Adams solution, for every set of (at most) t 
vertices, constraints @) imply a distribution on {0, 1} assignments to these vertices matching the 
LP values: 

Lemma 3.1. Let {yi} be a non-zero vector in the polytope SAt{n). Then for every set L C [n] 
of cardinality \L\ < t, there is a distribution fii on assiynments / : L — t- {0, 1} such that for all 
I,JCL, 

Pr^^ [(Vi G / : /(i) = 0) A (Vj G J : f{j) = 1)] = yi,j. 

In a Sparsest Cut relaxation, we are interested in the event in which a pair of vertices is cut 
(i.e. assigned different values). This is captured by the following linear variable: 

yif^j = y{i},{j} + y{j},{i}- 
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We can now define our relaxation for Sparsest Cut, SCr(G): 



min cap{i,j)yi^j 
s.t. ^ dem{i,j)yi^j = 1 

i,j&D 

{yi} G SAr+3(n) 
yi,J = yj,i 



V/, J s.t. |/U J| < r + 3 



(2) 

(3) 

(4) 
(5) 



Note that constraint (|^ is simply a normalization ensuring that the objective function is really 
a relaxation for the ratio of the two sums. Also note that constraint (P), which ensures that the 
LP solution is fully symmetric, does not strengthen the LP, in the following sense: For any solution 
{y'j} to the above LP without constraint (^), a new solution to the symmetric LP (with the same 
value in the objective function) can be achieved by taking yj = (y'j + y'j ^^)/2 without violating any 

of the other constraints. In particular, for every vertex i this gives i/j = 1 — i/j = ^. While our 
results hold true without imposing this constraint, we will retain it as it simplifies our analysis. 

Remark 3.2. The size of this LP (and the time needed to solve it) is n^^'^\ Specifically for bounded- 
treewidth graphs, we could also formulate a much smaller LP, where constraint (^ would be replaced 
with the condition {yi \ I C 5u{i, j}} G SAr.+3(r + 3) for every bag B and demand pair (i, j) G D. 
This would reduce the size of the LP to (and time needed to solve it) to at most poly(2^n), and 
our rounding algorithm and analysis would still hold. 



3.2 Rounding the LP 

Before we present the rounding algorithm, let us introduce some notation which will be useful in 
describing the algorithm. This notation will allow us to easily go back- and- forth between the LP 



solution and the local distributions on assignments described in Lemma 3.L For ease of notation, 
whenever two functions /i, /2 have disjoint domains, we will denote by /i U /2 the unique function 
from the union of the domains which is an extension of both /i and /2. 

.{yi} 



For every set of vectors {y/} and subset L C [n] as in Lemma 3.1, we will denote by fi^f^ 
the distribution on random assignments to L guaranteed by the lemma. We will omit the 
superscript {y/}, and simply write /U^, when it is clear from the context. 



Conversely, for any fixed assignment /' : L — )■ {0, 1}, we will write y// = 
-Lb = {i G -L I f'{i) = 6} for 6 = 0, 1. Thus, for a random assignment / 
distributed according to ^l, we have Pr[/ = /'] = yfi. 



L 



where 
{0,1} 



• For any nonempty subset V C L, and a given assignment fo : L\L' ^ {0, 1} in the support 
of IJ,L\L'^ we will denote by Hl'Jo distribution on random assignments f ^ fJ-i conditioned 
on the partial assignment /q. Formally, a random assignment /' : L' — t- {0, 1}, distributed 
according to ^l',/o satisfies Prj/[/' = /i] = y/oU/i/y/o fo^ every choice of /i : L' — >■ {0, 1}. 

Let G be an graph with treewidth r for some integer r > 0, and let {B, T) be the correspond- 
ing tree decomposition. Let {yi} be a vector satisfying SCr{G). We now present the rounding 
algorithm: 
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Algorithm SC-Round(G, {B,T), {yi}) [Constructs a random assignment f : V ^ {0, 1}] 

1. Pick an arbitrary bag Bq & B as the root of T, and sample JIbq according to /U^o- 

2. Traverse the rest of the tree T in any order from the root towards the leaves. For 
each bag B traversed, do the following: 

(a) Let B'^ be the set of vertices in B for which / is already defined, and let 
B^ = B \ B^. Let /o be the corresponding assignment /o = /|_b+- 

(b) If B~ is non-empty, sample fls- at random according to fM^- j^. 



Let us first see that every edge G E is cut with probability exactly yi-/,j- Since every edge 
is contained in at least one bag, it suffices to show that within every bag B, the assignment /|_b is 
distributed according to /i^. This is shown by the following straightforward lemma. 

Lemma 3.3. For every bag B, the assignment f\B produced by algorithm SC-Round{G, {B, T), {yi}) 
is distributed according to ^b- 

Proof. We show this by induction. For Bq this holds as the assignment is explicitly sampled 
according to this distribution. 



Now, let -B be a new bag traversed and B^ and B~ as in Step 2a. By the definition of a 
tree decomposition, and since the tree traversal maintains a single connected component, B'^ must 
be fully contained in some bag B' which has already been traversed. Thus, by the inductive 
hypothesis, /j^/ is distributed according to fiB', and in particular, f\B+ is distributed according to 
^B+- Note that this is also the marginal distribution of assignments to B^ according to ^b- Thus, 
the assignment to must lie in the support of ^b+ (this shows that Step ^ is well defined), 

and for every such fixed assignment /o, and every fixed assignment /i : B~ — >• {0, 1}, we have 

Pr[/b = /o U /i] = Pr[/|5+ = /o] • Vv[f\B- = fi \ /b+ = /o] 

= Pr//^^5[/%+ = /o] • Pr//^^^__^J/%- = /i] 

= Pr f,^^^[f'\B+ = M ■ Pif^^^M'lB- = fi I f'\B+ = fo] 
= Pr/'^MB[/'lB = /oU/i]. 

□ 

This lemma shows that the expected value of the cut is ^•^g^cap(z, which is exactly 

the value of the objective function (|2|) scaled by l/y0. In particular, for a host of other problems 
where the objective function and constraints depend only on the edges (e.g. Minimum Vertex 
Cover, Chromatic Number), this type of LP relaxation (normalized by setting ?/0 = 1), along with 
the above rounding, always produces an optimal solution for bounded-treewidth graphs. Thus, in 
some sense, we consider this to be a "natural" rounding algorithm. 

Before we analyze the expected value of the cut demands (or specifically, the probability that 
each demand is cut), let us show that the order in which the tree T is traversed has no effect on 
the distribution of cuts produced (it will suffice to show a slightly weaker claim - that the joint 
distribution of cuts in any two bags is not affected). This is shown in the following lemma. 

Lemma 3.4. Let Bi,B2 £ B be two arbitrary bags. Then the distribution on assignments f\BiUB2 
is invariant under any connected traversal ofT. 
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Proof. Since the set of traversed bags is always a connected component, it suffices to consider only 
the tree-path connecting Bi and B2- Let us proceed by induction on the length of the path. If 
Bi and B2 are adjacent bags, then regardless of the order in which they are traversed, for any 
fixed assignment /o : Bi n B2 {0,1}, if /Isin^a = fo, then fBi\B2 and /b2\Bi are distributed 
according to fJ'Bi\B2jo and /^_B2\-Bi,/o' respectively. Moreover, these two assignments are independent 
(after conditioning on f\BinB2 = fo) regardless of the order of traversal. Thus, in either ordering, 
the same distribution on cuts can be achieved by first sampling /|_BinB2 according to /iSinB2 5 
and then sampling /bi\_B2 and /b2\Bi independently according to the above distributions, where 

fo = f\Bir\B2- 

Now suppose that bags Bi and B2 are not adjacent, and let B[ the bad adjacent to Bi on 
the path to B2. By the inductive hypothesis, the distribution on f\B[uB2 invariant under the 
order in which the path between B[ and B2 is traversed. In particular, this is true for the distri- 
bution on f\{BinB[)u{B2\Bi)j call it T'. Thus, arguing as above, we see that any ordering results 
in the distribution on /biUB2 obtained by first sampling f\Bir\B[ according to /iBinsji and then 
sampling fBi\B[ according to A*_Bi\_bjj|^ and then independently sampling fB2\Bi according to 

the distribution conditioned on the value of /|_BinB' • 1^ 



4 Markov Flow Graphs 



In this section and the next two, we shall show the following lemma, which together with Lemma |3^ 
implies Theorem |1.1| (see Remark [4.3| ) . 

Lemma 4.1. For every integer r > there exists a constant Cr > such that for any treewidth- 
r graph G with tree decomposition {B,T), and vectors {yi} satisfying SCr{G), algorithm SC- 
Round{G, {B,T),{yj}) outputs a random f : V ^ {0,1} s.t. for every i,j E V, 



Pr[/(*) / f{j)] > crm^r 



(6) 



Remark 4.2. The constant Cr arising in our analysis is quite small (roug we 
believe this can be improved, we cannot eliminate the dependence on r, as a lower bound on the 
performance of our rounding algorithm (see Section |7|) shows that Cr cannot be more than 2"''/^. 

Remark 4.3. In fact. Lemmas 3.3 and taken together show the following: Given any solution to 



SCr(G) with objective function value a > 0, algorithm SC-Round produces a random assignment 
/ satisfying 



E 



cap(i,j)|/(i) 



/(J)l 



^ dem(^,i)|/(i) 



f{j)\ 



< 0. 



This means the algorithm produces a l/c^-approximation with positive probability, but does not 
immediately imply a lower bound on that probability. Fortunately, following the analysis in this 
section, the algorithm can be derandomized by the method of conditional expectations, since, at 
each step, finding the probability of separating each demand pair reduces to calculating the proba- 
bility of reaching a certain state at a certain phase in some Markov process, which simply involves 
multiplying 0{n) transition matrices of size at most 2^' x 2^' (in fact, these can be consolidated 
so that every step of the algorithm involves a total of 0(n|r|) small matrix multiplications for all 
demands combined, where T is the set of vertices participating in demand pairs). 
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For vertices i,j€zV belonging to (at least) one common bag, Lemma 3.3 implies equality in (|6D 
for Cr = 1. For i,j € V which do not lie in the same bag, consider the path of bags Bi, . . . , Bjsf in 
tree T from the (connected) component of bags containing i to the component of bags containing 
j. By Lemma |3.4| , we may assume that the algorithm traverses the path in order from Bi to B^. 

To understand the event that vertices i and j are separated, it suffices to consider the following 
incomplete (but consistent) description of the stochastic process involved: Let Sq = {i} and Sn = 
{j}, and let Si = Bi Ci -Bj+i for / = 1, . . . , — 1. The algorithm assigns f{i) a value in {0, 1} 
uniformly at random, and then for I = 1,. . . ,N, samples f\si from the distribution ^ (we 

extend the definition of fJ-sj' in the natural way to include the case where S may intersect the 
domain of /'). 

This is a Markov process, and can be viewed as a Markov flow graph. That is, a layered graph, 
where each layer consists of nodes representing the different states (in this case, assignments to 
Si), with exactly one unit of flow going from the flrst to the last layer, with all edges having flow 
at full capacity. Since all edges in the flow graph represent pairs of assignments within the same 



bag. Lemma 3^ implies that the capacity of an edge (transition) (/i,/2) is exactly y/iu/2) ^-^d 
the amount of flow going through each node /o is y/g. For the sake of clarity, we will refer to the 
elements of V as vertices and to the states in the flow graph as nodes. 

We now would like to analyze the contribution of a demand pair to the LP. By constraint (|5[), 
this contribution (up to a factor dem(i,j)) is iji^j = 2y|j} |j} = 2yj. , where /* : {i,j} — )• {0,1} 
is the function assigning to i and 1 to j'. Now consider a layer graph as above where each edge 
(/i) /2) has flow y/*u/iU/2- To see that this is indeed a flow, note that two consecutive layers along 



with i and j only involve at most r + 3 vertices in G, and so by Lemma 3.1 for any / > and function 
f2 '■ Si ^ {0, 1} the incoming flow at /2 must be y/*u/iUf2 = y/*u/2) and so is the outgoing 

flow. The total flow in this graph is exactly yj* (half the LP contribution Vijtj). Moreover, for each 
such edge (transition) we also have y/*u/iU/2 ^ y/iU/2- Hence, the flow with values {y/*u/iU/2} 
is a legal flow respecting the capacities {y/iU/2} in the Markov flow graph which represents the 
rounding algorithm. 

Thus it suffices to show the following: 

Theorem 4.4. For every integer k > \, there is a constant C = C{k) > such that for any 
symmetric Markov flow graph G = (Lq, . . . , Li\f , E) representing a Markov process Xq, . . . , X^r with 
sources Lq = {sq,si} and sinks Lj^ = {tQ,ti} and at most k nodes per layer, the total amount of 
capacity-respecting flow in G from sq to ti can be at most G ■ Pr[Xo = sq A X^^ = ti\. 

Applying this theorem to the Markov flow graph described above with k = 2^ immediately 



implies Lemma 4.1. As usual, to bound the amount of flow in a graph from above, it suffices to 



find a suitable cut, which is what we will do in the following section. 



5 Bounding the Cut Size 

In this section we prove Theorem |4.4| for k = A (the proof for the general case appears in Section |6|) . 
For Markov flow graph G = {Lq, . . . , Li\i, E) and corresponding Markov process Xq, . . . , X^r as in 
the theorem, for any integers < Zi < Z2 ^ and any vertices u G Li^ , v £ Li^ we will let 
p{u) = Pr[X;^ = u] be the probability of reaching u, and similarly, we define p{u,v) = Pr[X;j = 
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u A Xi^ = v]. In particular, when I2 = h + 1 and (n, v) is an edge (transition) then p{u, v) is also 
the capacity of this edge. Note that, by the symmetry of G, we have p(so) = p{si) = ^. 

5.1 A potential function for Markov flow graphs 



We will define a potential function on the layers of G, which will allow us to rephrase Theorem 1.1 
in more convenient terms. First, for any every layer I and vertex v € Li, let us define 

A{v) = Pr[Xo = so\Xi = v]-l. 

This function satisfies the following stochastic property: 

Lemma 5.1. For and < li < I2 and v £ Li^ we have 

^{V) = r . 

Proof. By the Markov property, we have 

A{v) + I = Pr[Xo = So I Xi^ = v] = p{so,v)/p{v) = {p{u,v)/p{u))p{so,u) 



Now, for every layer Z = 0, . . . , A^, let us define the following potential function: 

2 



□ 



The following lemma show that this potential function is monotone decreasing in /, and relates the 
decrease directly to the transitions in the Markov process: 

Lemma 5.2. For all < /i < I2, we have ^p{li) — ^{h) = p{u,v){A{u) — A{v))'^ . 



Proof. By Lemma 5.1, we have 
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Therefore, we have 



= ^2 v)A{u)'^ — ^^^(11)^(1;)^ since p{u) = 1 



{A{u)'^ + A{vf) - 2^p(7;)A(i))^ since p{v) = 

U V V u 

^ v) {A{uf + A{vf - 2A{u)A{v))) . by Lemma 



u,v) 



□ 



Recah that we want to bound the possible flow from sq to ti by O(p(so,ti)). We may assume 
that p(so,ti) < \ (i.e. A{ti) < 0), since otherwise the bound is trivial. Note that, by symmetry, 
we have A{sq) = —A{si) and A{tQ) = —A{ti). Since we have only two sources and two sinks, this 
implies 

^(0) - ^{N) = Aiso)^ - A{ti)^ = I - A{h)^ = I - (2p(so, ti) - If < 2p(so, h). 
Therefore, to prove Theorem |4.4| it suffices to show 



Lemma 5.3. For every k > there is some constant C = C{k) such that for any G as above, with 
A{ti) < 0, there is a cut in G separating sq from ti of capacity at most C ■ {f{0) — ip{N)). 

Symmetry implies that k is even, and the case of A; = 2 is fairly trivial. We will first consider 
the simpler case of /e = 4, while the general case is shown in Section ^ 

5.2 Treewidth 2 

Let us start with the case of A; = 4, or r = 2. This shows some of the main ideas in the analysis for 
larger k, while still being relatively simple. It is also an non-trivial special it covers series- 

parallel graphs. A more careful analysis would yield a smaller constant C (we did not optimize). 



Lemma 5.4. Lemma |5j holds for k = A and C = 100. 

Proof. We may assume that <.p{N) > Otherwise, since the capacity of sq is ^, the cost of 

simply cutting the outgoing edges of sq is ^ = C/200 < G{ip{0) — ip{N)). Let us denote A* = 
A{to) = y^ip{N){> Y57f)- Let us start by cutting all edges {u,v) for which |^(m) - A{v)\ > ^A*. 
By Lemma 5.2, the total capacity of these edges is at most 

J2 E Piu,v)<ijL^fYl E p{u,v)iAiu) - Aiv)f 

1=1 lie-L;_i,D_Lj 1=1 u£Li_i,VL^ 

\Aiu}-A(v)\>^A* \A(u)-A(v)\>^A* (7) 

t 

< i2j^f - 1) - vim = i2j^?im - viN)). 
1=1 
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Let us examine the rest of the graph. By symmetry, and since is monotone decreasing, every 
layer I must contain some vertex vi such that A{vi) > A{to) = A*, and a corresponding vertex vi 
with A{vi) = —A{vi). Consider the inner two vertices ui,ui (with A{vi) > A{ui) = —A{ui) > 0). 
Since there are no direct edges from vi-i to vi (the edge would be longer than ^A*), the flow 
must travel along paths using the vertices {ui,ui}i. Since we have cut long edges, the A{-) values 
of these paths must pass through the interval [— Let li be the first such interval for 
which there is flow from sq to n^j. By a similar argument, any flow from sq to Ui-^ must pass 
through the interval [|yl*,|^*] (before layer li). Let us take the last such layer, say (it can 
be checked that all flow to u/^ and ui-^ must pass through ui^). To cut all flow to ui^,ui^, it 
suffices to remove vertex ui^, or equivalently, to cut all outgoing edges from n^,,. Note that for all 
vertices w € Li-^ we have \A{w) - A{uiq)\ > jA*, since A{uif^) e A{vi^) > A*, and 

— ^(^^i) ^ ^(^^i) ^ J A*. Hence, by Lemma |5.2| , the cost of cutting vertex ui^ is at most 

pK)) = Yl P(.uio,w) < Yl P(.uio,w){A{ui^) - A{w)f 

w£Li^ weLi-^ (8) 

It is easy to see that any more flow from sq to ti must start at vi^ for some layer I2 > ^i, 
and so we can repeat the above argument, cutting vertices with A{-) value in {^A* , |], and paying 
(2^)^(97(^1) — <f{li+i)) each time for non-overlapping intervals [Iq, h], [Z2, ^3], • • • , [ImJm+i], until we 
have severed all flow. Combining this with the cost incurred in (^), we can bound the total capacity 
of edges cut by 2{^)^{ip{0) - ^{N)) < 100((^(0) - f{N)). □ 

To summarize the above approach, our cutting technique follows a two phase process. First, we 
cut all "long" edges, which helps us isolate individual paths in the flow. Then, we isolate portions 
of the graph where the individual paths have a large shift in A(-) value (e.g. move from the interval 
[jA*, J A*] to the interval [— yA*, ^^*]), and cut such paths by removing a single vertex, charging 
to the difference in potential along that portion of the graph. 

There are a number of technical difficulties involved in extending this argument to work for 
larger k. First, we cannot isolate specific intervals through which flow must pass in an isolated 
path, as these depend on the A{-) values of other vertices in nearby layers. Secondly, even after 
cutting a path at some node, we are not guaranteed that there is no other path which routes 
flow around the node we cut. Rather than decompose the graph into isolated paths, we cut in 
several (roughly k) phases using a cut-and-cluster approach. Namely, after cutting, we "cluster" 
together all vertices (or clusters from the previous phase) in a single layer that are close together 
in A{-) value, and ensure that the number of clusters per layer which can contain flow from sq to 
ti decreases after every phase. 

Unfortunately, our threshold for clustering vertices increases by roughly a /c^ factor after every 
phase, thus ultimately incurring a loss which is exponential in k (or doubly-exponential in r). We 
note that while this does not match our Q{k){= 0(2^')) lower-bound, the lower-bound at least shows 
that we can not expect to get any "reasonable" dependence on r (say, O(logr)) with our rounding. 

6 Performance guarantee for general bounded treewidth 



Let us begin with a simple lemma which was implicit in the analysis of Lemma 5.4 
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Lemma 6.1. Let be a sequence of non- overlapping intervals for integers < < Ij < N 

and let Wj C Lfi be sets of nodes in layer Lf such that for every node w G Wj and every node 
X G L^i we have \A{w) — A{x)\ > p for some p > 0. Then the cost of removing all w G Wj for every 
j (i.e. cutting all outgoing edges from w) is at most -^{ip{0) — ip{N)). 

As we will use this lemma repeatedly, for a set of edges (resp. nodes) T, we will call the value 
p{T)/{(p{0) — (p{N)) the relative cost of T, where p{T) is the total capacity of the edges (resp. 
nodes) in T. Thus our goal will be to find a cut of constant relative cost. 

Let us introduce some terminology and notation: 

Definition 6.2. In the context of this section, the distance between two nodes u,v will always 
refer to the value \A{u) — A{v)\, which we will also call the length of {u,v) when {u,v) is an edge. 
For two non-overlapping clusters (defined below), we define the distance between them to be the 
minimum distance between two nodes, one in each cluster. 

Definition 6.3. For any e > 0, an e-cluster is a set of nodes belonging to a single layer, such 
that when ordered by their respective j4(-)-values, every two consecutive nodes are at distance at 
most e from each other. For any cluster X, we will denote A^{X) = max„gx A{v) and A~{X) = 
min^igx A{v), and we will refer to the value A~^{X) — A^ (X) as the width of X. 

Note that the width of any e-cluster is at most {k — l)e. 

Definition 6.4. In a Markov fiow graph as above, with some edges already cut, we will call a 
cluster X viable if there is any capacity-respecting fiow in the remaining graph from sq to ti which 
goes through at least one node of X. For any clustering of the graph, we will define the clustered 
capacity to be the maximum number of viable clusters per layer, over all layers. 



Let us now prove Lemma 5.3 



Proof of Lemma |5. S{ . Let us assume that A{ti) < — | (recall that we've assumed A{ti) < 0). 
Otherwise, simply cutting the outgoing edges of sq yields a cut of relative cost ^/{(p{0) — (f{N)) < 

As discussed earlier, we will proceed in k phases. In each phase, we will reduce the clustered 
capacity of the graph. Each phase will consist of first cutting some clusters (i.e. removing all 
outgoing edges from the nodes in these clusters), and then increasing the size of certain other 
clusters (by increasing the threshold for clustering). 

We begin by first cutting all edges of length at least eo (for some eo > to be determined soon). 
By Lemma |6.1| , the relative cost of this cut is at most I/eq. At the end of each phase j, we will 
cluster the nodes with clustering threshold £j = (12A:^)-'eo, while we will require that ksk-i < g- 
Thus we set eo = l/(6A:(12A;2)'^-i). As we shall see, the relative cost of the cut at phase j will be 
at most l/(/cej_i)^. Thus, the total relative cost of our cut will be at most 

Before describing and analyzing the individual phases, let us note that at phase j every cluster 
has width at most {k — l)ej_i. Since we have cut all edges of length at least for two clusters 
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Xi,X2 in consecutive layers, there can be flow from Xi to X2 only if A {X2) — A^{Xi) < eq and 
A~{Xi) — A^{X2) < £o- In particular, when there is such flow, we have 

max{|^+(X2) - A+{Xi)\,\A"{X2) - A" {Xi)\} < {k - + eo < (9) 

We now proceed by induction on the clustered capacity. If the clustered capacity is 1, then there 
is a single "path" of clusters from so to ti. Let j be the current phase (1 < j < fe)- By our choice 
of £j-i, and by @, the value A'^{X) of any (ej_i-)cluster in the path can increase or decrease by 
at most ksk-i < | at each step. Since this value starts at A{so) = ^, and ends at A{ti) < — |, 
at some point it must pass through the interval [0, g]. Call this layer /i, and the corresponding 
cluster Xi^. Now A'^(Xi^i) < and since the width of this cluster is at most g, we also have 
A~{Xi^i) > — ^. Thus all nodes in the cluster are at distance at least ^ from to and ti. Thus, by 
Lemma |6.1| , the relative cost of cutting all flow along the path by removing Xi^ is at most 36. 

Now suppose the clustered capacity is k' for some 1 < k' < k, and let j denote the current 
phase. For any given layer, if the number of viable ej_i-clusters is strictly less than k' , then we are 
done with that particular layer. Otherwise, if there are k' viable clusters in a layer and any two 
of them are at distance at most £j from each other, then again we are done with that layer, since 
at the end of the phase the two clusters will be merged (possibly along with additional clusters) 
into a single ej -cluster. Thus, we only need to reduce the number of viable clusters in layers which 
contain k' distinct viable ej_i-clusters whose pairwise distances are all greater than ej. 

Let us denote the first such layer by li, and the viable clusters by . . . ,Xj}, in increasing 
order of A{-) values. Note that any viable cluster Xi must have a corresponding viable cluster X[ 
in the subsequent layer satisfying (^). Moreover, for any two ej_i-clusters Xi,X2 in layer Li^ such 
that A~^{Xi) < A~{X2) — J with flow into clusters X[,X2, respectively, in the subsequent layer, 
there cannot be any flow from Xi to X2 or from X2 to X[. Indeed, the distance from, say, Xi to 
X2 is greater than ^ — kej^i > ej^i > Eq. In particular, for any layer containing k' viable clusters 
with pairwise distance greater than each cluster must have flow into exactly one cluster in 

the subsequent layer (there cannot be more, since no layer contains more than k' viable clusters in 
this phase). 

Let us denote by I2 the first layer after li in which some pair of adjacent viable clusters X^f , X^f_^^ 
are at distance at most y . By the above argument, all viable flow in the portion of the graph from 
Li^ to Li^ flows through k' disjoint cluster-paths X^^ ^ ■ ■ ■ ^ ^i'^ (fo^' ^ = 1, • • • ,^')- 

Therefore, to reduce the number of viable clusters in all layers Li^ , ■ ■ ■ , Li^ , it suffices to cut just one 
cluster Xl for some i G {!,... ,k'} and some li < I < l2- Consider the pair of clusters 
Since these clusters are at distance at most ^ and the corresponding clusters X-/ , X^-}_^-^ are at 
distance at least ej, either A+{xlf) - A+{X^^) > ^, or yl" (X^'/_^_ J - A" (X|?_^ J > ^. Without loss 
of generality, suppose the former. 

Now consider the open real interval {A~^{X'^}), A~^{X'^?)). It has length at least y, and contains 
at most k — 1 values in {A(v) \ v E Li^} (at least one node v in layer Li^ has A{v) = A+{Xlf)). 
Therefore there is an open subinterval (ao,ai) of length at least ^ = Akej^i containing none of 
these values. By (|9|), there must be some layer Li (for some li < I < I2) for which A'^{X^,) € 
(ao + 2kej-i,ao + Skej-i). Since X^, has width at most ksj-i, it must also satisfy A~{X^,) S 
(ao + kej-i,ao + 3kej-i). In particular, all nodes in are at distance at least kej-i from all 
nodes in layer Li^. We can now repeat this argument for layers > I2, until we have exhausted all 
layers in the graph, and by Lemma oA, the relative cost of the cut will be at most l/(A;ej_i)^, as 
required. □ 
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7 A Lower Bound for the Rounding Algorithm 



In this section, we give a lower bound on the quahty of approximation of algorithm SC-Round. It 
is not known whether this can be translated into an integrality gap for our LP (in fact, for our 
construction, the integrality gap is 1). We start by showing that the reduction to Markov flows 
discussed in Section |^ goes both ways. Specifically, we show the following lemma: 

Lemma 7.1. Let r > be a positive integer, let H = (Lq, . . . , Ljv, E) be a symmetric Markov flow 
graph with sources Lq = {sq,si} and sinks L^ = {ioi^i} with at most states per layer, and let 
F be a capacity-respecting flow from sq to ti . Then there is a graph G of pathwidth at most 2r — 1 
with one demand pair {s,t) and a feasible (but not necessarily optimal) solution {yi} to SC2r{G) 
such that H represents the distribution on assignments found by algorithm SC-Round for G and 
{yi}, and ys^t ^ {F'l (the amount of flow in F). 

Proof. Let -Fgym = ^{F + (where F is the flow from si to to corresponding symmetrically to F). 
By the symmetry of H, this is also a capacity-respecting (multi-)flow. Since the capacities in H 
are themselves a flow, the residual capacity in H can be decomposed into two multiflows F^ and 
F= (between opposite and same-side terminals, respectively). As before, we may assume that both 
multiflows are symmetric. Thus, H can be decomposed into a sum of flows Fgym + F-/, + F= from 
{soj^i} to {to,ti} where the total amount of flow between opposite terminals is |-Fsym| + \F^\ > 
\F I — \F\ 

I sym I — I I • 

Now define a graph G on vertices IJ^g ^'i^ where B'q = {s}, B'^ = {t}, and for all < z < A^, 
l-B^I = r, and some edge set which admits a path-decomposition with bags Bi = B'-^JB'-j^^. Then for 
an appropriate symmetry-preserving correspondence between nodes of H and local assignments to 
vertices of G, every path from {sq, si} to {to, ^i} in H corresponds to a full assignment / : (J iJ^' — )• 
{0, 1}. Thus the flow decomposition above can be viewed as a symmetric distribution on paths, 
which corresponds to a symmetric distribution on {0, 1} assignments in G. Let {y/} be the (level 
n) Sherali- Adams solution corresponding to this distribution. Note that ys^t = |-^sym| + 1-^7^1 ^ l-^l- 
It is also not hard to see that algorithm SC-Round given the path decomposition and any scaling of 
{y/} will produce a distribution on assignments corresponding to the flow graph H. Thus, letting 
{y/} be an appropriate scaling (satisfying constraint (^)) completes the proof. □ 

By the above lemma, to get a lower-bound for our rounding which is exponential in the treewidth 
of the graph, it suffices, for every even integer A: > 4, to construct a Markov flow graph as in 



Theorem 4^ with at most k nodes per layer, which admits r2(A:)(^ -|- A{ti)) units of capacity- 
respecting flow from So to ti ((^ -|- A{t\)) is the probability that the Markov chain starts and ends 
at opposite terminals (so,ti) or (si,to) ~ see Section ?? for the definition of A[-)). Let us see such 
a construction now. 



Construction For every sufficiently large integer A^ and e G (0, 2(Af+fc) )' Hf^{N,e) be the 
following layered capacitated digraph: As before, the nodes will consist of layers Lq, Li, . . . , L^ 
where Lq = {sq, si} and Ljy = {to, ti}- For every < j < A^, we have L; = {vq,vI, . . . , vl_j}. 

For every i = 1, . . . , k — 2 we add (directed) edges {sQ,vj) and (si,ffc„i„j) with capacities 
2e{k — 1 — i)/{k — 1) (respectively). We also add edges (so,^^o) (■^I'^fc-i) each with capacity 
^ — {k — 2)e. Next, for between every two consecutive layers Lj, Lj j^i (for 1 < j < A^ — 2) we add 
the following directed edges: 
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• For all i = 1, . . . , k — 2 add edges {vi_-^^,vi~^^) and {vi_^_-^^,vi~^^) each with capacity e. 

• Add edges {v^Q,vp~^) and {vi._^,vi,'^\) each with capacity ^ — (j + k — 2)e. 

• Add edges (-yjj-yj^^) and {vi._2,vj,'^2) each with capacity je. 

Finahy, for i = 0, . . . , | — 1 add edges {vf~^,tQ) and (v^^-j'^i) with the full capacity of the 
respective layer Lat-i node (i.e. capacity 2e for i = 2, . . . , A; — 3; capacity | — (A^ + /c — 4)e for 
i = 0, A; — 1; and capacity Ne for i = l,k — 2). 



From the above construction, the definition of ^(•), and Lemma 5.1, the following claim follows 
immediately: 

Claim 7.2. Inflow graph Hk{N,e) we have 



1. for all layers j = 1, . . . N — 1 and all i = 0, . . . ,k we have Ai^vj) = ^ 



2. A{to) = i - (AT + (I - 2)(| + 2)) e/k and A{h) = -i + (A^ + (| - 2)(| + 2)) e/{k - 1). 
Thus, if we take A^ = io{k'^) then 

^ + A{ti) = {l + oil))Ne/{k-l). (10) 
On the other hand, consider the flow (written as a weighted sum of paths) F = Pj-, 



where path pj is defined as 



Pj = So 



j+k-2 
4-2 



j+k-1 
'^k-2 



N-1 
^k-2 



tl. 



It can readily be checked that F is a capacity-respecting flow in Hj.[N, e) and that \F\ = [N — k)e. 
Thus, by (|l|), for A^ = uj{k'^) we have |F| > (1 - o(l))(/c + A{ti)), which is what we wanted 

to show. 
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A NP-hardness for pathwidth 2 



Theorem A.l. Sparsest-Cut (with general demands) is NP-hard even on graphs of pathwidth 2. 

Proof. The theorem fohows from the foUowing reduction from Max-Cut. Let graph G = {V, E) 
be an instance of Max-Cut. Construct an instance of Sparsest-Cut on the graph -fC2,n as follows: 
identify every vertex Vi £ V with a corresponding vertex f • in the new graph. For every edge 
{vi,Vj) G E add a demand pair {v'^,Vj) with unit demand. Add two vertices s, t and edges {(s, v[)}i 
and {{t,v[)^i. Finally, make a demand pair with demand n^. Consider some cut {S,T) in the 
new graph. If s € 5" and t £ T then the number of cut edges is exactly n. Thus the sparseness of 
the cut is exactly n/{D{S,T)) = n/{n^ + \Eg{S \ {s},T \ {t})\). Therefore the sparsest cut that 
separates s from t corresponds exactly to the max cut in G. It remains to show that the sparsest 
cut in the new graph must separate s from t. Indeed, if {S',T') is a cut for which s,t £ S' , then 
the sparseness of the cut is 

\T'\ _ \T'\ n 

D{S',T') ~ \EG{S'\{s,t},T')\ - n\EG{S'\{s,t},T')\ 

n _ 
^ n^ + \EG{S'\{s,t},T')\ = D{S'\t,T'n{t}y 

□ 
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